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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CTCGGGTAAC TTCTTGAGCG CGGCCACAGC AGCCTTGATC ATGAAGGCGA GCATGGTGAC 6 0 

CTTGACGCCG CTCTTTTCGT TCTCTTTGTT GAACTGCACG CGAAAGGCTT CCAGGTCGGT 12 0 

GATGTCCGCG TCGTCGTGGT TGGTGACGTG CGGGATGACC ACCCAGTTGC GGTGCAGGTT 180 

TTTCGATGGC ATAATATCTG CGTTGCGACG TGTAACACAC TATTGGAGAC ATATCATGCA 24 0 

AACGCTCAGC ATCCAGCACG GTACCCTCGT CACGATGGAT CAGTACCGCA GAGTCCTTGG 300 

GGATAGCTGG GTTCACGTGC AGGATGGACG GATCGTCGCG CTCGGAGTGC ACGCCGAGTC 3 60 

GGTGCCTCCG CCAGCGGATC GGGTGATCGA TGCACGCGGC AAGGTCGTGT TACCCGGTTT 42 0 

CATCAATGCC CACACCCATG TGAACCAGAT CCTCCTGCGC GGAGGGCCCT CGCACGGACG 480 

TCAATTCTAT GACTGGCTGT TCAACGTTGT GTATCCGGGA CAAAAGGCGA TGAGACCGGA 54 0 

GGACGTAGCG GTGGCGGTGA GGTTGTATTG TGCGGAAGCT GTGCGCAGCG GGATTACGAC 600 

GATCAACGAA AACGCCGATT CGGCCATCTA CCCAGGCAAC ATCGAGGCCG CGATGGCGGT 660 

CTATGGTGAG GTGGGTGTGA GGGTCGTCTA CGCCCGCATG TTCTTTGATC GGATGGACGG 720 

GCGCATTCAA GGGTATGTGG ACGCCTTGAA GGCTCGCTCT CCCCAAGTCG AACTGTGCTC 780 

GATCATGGAG GAAACGGCTG TGGCCAAAGA TCGGATCACA GCCCTGTCAG ATCAGTATCA 84 0 

TGGCACGGCA GGAGGTCGTA TATCAGTTTG GCCCGCTCCT GCCACTACCA CGGCGGTGAC 900 

AGTTGAAGGA ATGCGATGGG CACAAGCCTT CGCCCGTGAT CGGGCGGTAA TGTGGACGCT 96 0 

TCACATGGCG GAGAGCGATC ATGATGAGCG GATTCATGGG ATGAGTCCCG CCGAGTACAT 102 0 

GGAGTGTTAC GGACTCTTGG ATGAGCGTCT GCAGGTCGCG CATTGCGTGT ACTTTGACCG 10 8 0 

GAAGGATGTT CGGCTGCTGC ACCGCCACAA TGTGAAGGTC GCGTCGCAGG TTGTGAGCAA 114 0 

TGCCTACCTC GGCTCAGGGG TGGCCCCCGT GCCAGAGATG GTGGAGCGCG GCATGGCCGT 12 0 0 

GGGCATTGGA ACAGATAACG GGAATAGTAA TGACTCCGCA AACATGATCG GAGACATGAA 12 6 0 

GTTTATGGCC CATATTCACC GCGCGGTGCA TCGGGATGCG GACGTGCTGA CCCCAGAGAA 13 2 0 

GATTCTTGAA ATGGCGACGA TCGATGGGGC GCGTTCGTTG GGAATGGACC ACGAGATTGG 13 8 0 

TTCCATCGAA ACCGGCAAGC GCGCGGACCT TATCCTGCTT GACCTGCGTC ACCTCAGACG 144 0 



# 



ACTCTCACAT CATTTGGCGG CCACGATCGT GTTTCAGGCT TACGGCAATG AGGTGGACAC 15 00 

TGTCCTGATT GACGGAAACG TTGTGATGGA GAACCGCCGC TTGAGCTTTC TTCCCCCTGA 15 6 0 

ACGTGAGTTG GCGTTCCTTG AGGAAGCGCA GAGCCGCGCC ACAGCTATTT TGCAGCGGGC 162 0 

GAACATGGTG GCTAACCCAG CTTGGCGCAG CCTCTAGGAA ATGACGCCGT TGCTGCATCC 168 0 

GCCGCCCCTT GAGGAAATCG CTGCCATCTT GGCGCGGCTC GGATTGGGGG GCGGACATGA 174 0 

CCTTGATGGA TACAGAATTG CCATGAATGC GGCACTTCCG TCCTTCGCTC GTGTGGAATC 18 0 0 

GTTGGTAGGT GAGGGTCGAC TGCGGGCGCC AGCTTCCCGA AGAGGTGAAA GGCCCGAG 185 8 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Gin Thr Leu Ser lie Gin His Gly Thr Leu Val Thr Met Asp Gin 
15 10 15 

Tyr Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg 
20 25 30 

lie Val Ala Leu Gly Val His Ala Glu Ser Val Pro Pro Pro Ala Asp 
35 40 45 

Arg Val lie Asp Ala Arg Gly Lys Val Val Leu Pro Gly Phe lie Asn 
50 55 60 

Ala His Thr His Val Asn Gin lie Leu Leu Arg Gly Gly Pro Ser His 
65 70 75 80 

Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro Gly Gin 
85 90 95 

Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu Tyr Cys 
100 105 110 

Ala Glu Ala Val Arg Ser Gly lie Thr Thr lie Asn Glu Asn Ala Asp 
115 120 125 



Ser Ala lie Tyr 
130 

Glu Val Gly Val 
145 

Asp Gly Arg lie 



Gin Val Glu Leu 
180 

Arg He Thr Ala 
195 

He Ser Val Trp 
210 

Gly Met Arg Trp 
225 

Thr Leu His Met 



Ser Pro Ala Glu 
260 

Gin Val Ala His 
275 

His Arg His Asn 
290 

Leu Gly Ser Gly 
305 

Ala Val Gly He 



Met He Gly Asp 
340 

Arg Asp Ala Asp 
355 

He Asp Gly Ala 
370 

Glu Thr Gly Lys 
385 

Arg Arg Leu Ser 



Gly Asn Glu Val 
420 

Asn Arg Arg Leu 
435 



Pro Gly Asn lie 
135 

Arg Val Val Tyr 
150 

Gin Gly Tyr Val 
165 

Cys Ser He Met 



Leu Ser Asp Gin 
200 

Pro Ala Pro Ala 
215 

Ala Gin Ala Phe 
230 

Ala Glu Ser Asp 
245 

Tyr Met Glu Cys 



Cys Val Tyr Phe 
280 

Val Lys Val Ala 
295 

Val Ala Pro Val 
310 

Gly Thr Asp Asn 
325 

Met Lys Phe Met 



Val Leu Thr Pro 
360 

Arg Ser Leu Gly 
375 

Arg Ala Asp Leu 
390 

His His Leu Ala 
405 

Asp Thr Val Leu 



Ser Phe Leu Pro 
440 



Glu Ala Ala Met 
140 

Ala Arg Met Phe 
155 

Asp Ala Leu Lys 
170 

Glu Glu Thr Ala 
185 

Tyr His Gly Thr 



Thr Thr Thr Ala 
220 

Ala Arg Asp Arg 
235 

His Asp Glu Arg 
250 

Tyr Gly Leu Leu 
265 

Asp Arg Lys Asp 



Ser Gin Val Val 
300 

Pro Glu Met Val 
315 

Gly Asn Ser Asn 
330 

Ala His He His 
345 

Glu Lys He Leu 



Met Asp His Glu 
380 

He Leu Leu Asp 
395 

Ala Thr He Val 
410 

He Asp Gly Asn 
425 

Pro Glu Arg Glu 



Ala Val Tyr Gly 



Phe Asp Arg Met 
160 

Ala Arg Ser Pro 
175 

Val Ala Lys Asp 
190 

Ala Gly Gly Arg 
205 

Val Thr Val Glu 



Ala Val Met Trp 
240 

He His Gly Met 
255 

Asp Glu Arg Leu 
270 

Val Arg Leu Leu 
285 

Ser Asn Ala Tyr 



Glu Arg Gly Met 
320 

Asp Ser Ala Asn 
335 

Arg Ala Val His 
350 

Glu Met Ala Thr 
365 

He Gly Ser He 



Leu Arg His Leu 
400 

Phe Gin Ala Tyr 
415 

Val Val Met Glu 
430 

Leu Ala Phe Leu 
445 




Glu Glu Ala Gin Ser Arg Ala Thr Ala He Leu Gin Arg Ala Asn Met 
450 455 460 

Val Ala Asn Pro Ala Trp Arg Ser Leu 
465 470 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GCGAGCATGG TGACCTTGAC GCCGCTCTTT TCGTTCTCTT TGTTGAACTG CACGCGAAAG 6 0 

GCTTCCAGGT CGGTGATGTC CGCGTCGTCG TGGTTGGTGA CGTGCGGGAT GACCACCCAG 12 0 

TTGCGGTGCA GGTTTTTCGA TGGCATAATA TCTGCGTTGC GACGTGTAAC ACACTATTGG 180 

AGACATATCA TGCAAACGCT CAGCATCCAG CACGGTACCC TCGTCACGAT GGATCAGTAC 24 0 

CGCAGAGTCC TTGGGGATAG CTGGGTTCAC GTGCAGGATG GACGGATCGT CGCGCTCGGA 3 00 

GTGCACGCCG AGTCGGTGCC TCCGCCAGCG GATCGGGTGA TCGATGCACG CGGCAAGGTC 36 0 

GTGTTACCCG GTTTCATCAA TGCCCACACC CATGTGAACC AGATCCTCCT GCGCGGAGGG 42 0 

CCCTCGCACG GGCGTCAATT CTATGACTGG CTGTTCAACG TTGTGTATCC GGGACAAAAG 48 0 

GCGATGAGAC CGGAGGACGT AGCGGTGGCG GTGAGGTTGT ATTGTGCGGA AGCTGTGCGC 54 0 

AGCGGGATTA CGACGATCAA CGAAAACGCC GATTCGGCCA TCTACCCAGG CAACATCGAG 6 00 

GCCGCGATGG CGGTCTATGG TGAGGTGGGT GTGAGGGTCG TCTACGCCCG CATGTTCTTT 66 0 

GATCGGATGG ACGGGCGCAT TCAAGGGTAT GTGGACGCCT TGAAGGCTCG CTCTCCCCAA 72 0 

GTCGAACTGT GCTCGATCAT GGAGGGAACG GCTGTGGCCA AAGATCGGAT CACAGCCCTG 78 0 

TCAGATCAGT ATCATGGCAC GGCAGGAGGT CGTATATCAG TTTGGCCCGC TCCTGCCACT 84 0 

ACCACGGCGG TGACAGTTGA AGGAATGCGA TGGGCACAAG CCTTCGCCCG TGATCGGGCG 90 0 

GTAATGTGGA CGCTTCACAT GGCGGAGAGC GATCATGATG AGCGGATTCA TGGGATGAGT 96 0 

CCCGCCGAGT ACATGGAGTG TTACGGACTC TTGGATGAGC GTCTGCAGGT CGCGCATTGC 102 0 

GTGTACTTTG ACCGGAAGGA TGTTCGGCTG CTGCACCGCC ACAATGTGAA GGTCGCGTCG 108 0 

CAGGTTGTGA GCAATGCCTA CCTCGGCTCA GGGGTGGCCC CCGTGCCAGA GATGGTGGAG 114 0 

CGCGGCATGG CCGTGGGCAT TGGAACAGAT AACGGGAATA GTAATGACTC CGTAAACATG 120 0 



ATCGGAGACA TGAAGTTTAT GGCCCATATT CACCGCGCGG TGCATCGGGA TGCGGACGTG 1260 

CTGACCCCAG AGAAGATTCT TGAAATGGCG ACGATCGATG GGGCGCGTTC GTTGGGAATG 13 20 

GACCACGAGA TTGGTTCCAT CGAAACCGGC AAGCGCGCGG ACCTTATCCT GCTTGACCTG 13 8 0 

CGTCACCCTC AGACGACTCC TCACCATCAT TTGGCGGCCA CGATCGTGTT TCAGGCTTAC 144 0 

GGCAATGAGG TGGACACTGT CCTGATTGAC GGAAACGTTG TGATGGAGAA CCGCCGCTTG 15 00 

AGCTTTCTTC CCCCTGAACG TGAGTTGGCG TTCCTTGAGG AAGCGCAGAG CCGCGCCACA 1560 

GCTATTTTGC AGCGGGCGAA CATGGTGGCT AACCCAGCTT GGCGCAGCCT CTAGGAAATG 1620 

ACGCCGTTGC TGCATCCGCC GCCCCTTGAG GAAATCGCTG CCATCTTGGC GCGGCTCGGA 16 80 

TTGGGGGGCG GACATGACCT TGATGGATAC AGAATTGCCA TGAATGCGGC ACTTCCGTCC 174 0 

TTCGCTCGTG TGGAATCGTT GGTAGGTGAG GGTCGACTGC GGGCGCCAGC TTCCCGAAGA 18 00 

AGTGAAAG 1808 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GAGCGCCGCC ACAGCAGCCT TGATCATGAA GGCGAGCATG GTGACCTTGA CGCCGCTCTT 6 0 

TTCGTTCTCT TTGTTGAACT GCACGCGAAA GGCTTCCAGG TCGGTGATGT CCGCGTCGTC 120 

GTGGTTGGTG ACGTGCGGGA TGACCACCCA GTTGCGGTGC AGGTTTTTCG ATGGCGTAAT 18 0 

ATCTGCGTTG CGACGTGTAA CACACTATTG GAGACATATC ATGCAAACGC TCAGCATCCA 24 0 

GCACGGTACC CTCGTCACGA TGGATCAGTA CCGCAGAGTC CTTGGGGATA GCTGGGTTCA 3 00 

CGTGCAGGAT GGACGGATCG TCGCGCTCGG AGTGCACGCC GAGTCGGTGC CTCCGCCAGC 3 60 

GGATCGGGTG ATCGATGCAC GCGGCAAGGT CGTGTTACCC GGTTTCATCA ATGCCCACAC 42 0 

CCATGTGAAC CAGATCCTCC TGCGCGGAGG GCCCTCGCAC GGGCGTCAAT TCTATGACTG 48 0 

GCTGTTCAAC GTTGTGTATC CGGGACAAAA GGCGATGAGA CCGGAGGACG TAGCGGTGGC 54 0 

GGTGAGGTTG TATTGTGCGG AAGCTGTGCG CAGCGGGATT ACGACGATCA ACGAAAACGC 6 00 

CGATTCGGCC ATCTACCCAG GCAACATCGA GGCCGCGATG GCGGTCTATG GTGAGGTGGG 660 

TGTGAGGGTC GTCTACGCCC GCATGTTCTT TGATCGGATG GACGGGCGCA TTCAAGGGTA 72 0 



TGTGGACGCC TTGAAGGCTC GCTCTCCCCA AGTCGAACTG TGCTCGATCA TGGAGGAAAC 78 0 

GGCTGTGGCC AAAGATCGGA TCACAGCCCT GTCAGATCAG TATCATGGCA CGGCAGGAGG 84 0 

TCGTATATCA GTTTGGCCCG CTCCTGCCAC TACCACGGCG GTGACAGTTG AAGGAATGCG 90 0 

ATGGGCACAA GCCTTCGCCC GTGATCGGGC GGTAATGTGG ACGCTTCACA TGGCGGAGAG 96 0 

CGATCATGAT GAGCGGATTC ATGGGATGAG TCCCGCCGAT TACATGGAGT GTTACGGACT 102 0 

CTTGGATGAG CGTCTGCAGG TCGCGCATTG CGTGTACTTT GACCGGAAGG ATGTTCGGCT 10 8 0 

GCTGCACCGC CACAATGTGA AGGTCGCGTC GCAGGTTGTG AGCAATGCCT ACCTCGGCTC 114 0 

AGGGGTGGCC CCCGTGCCAG AGATGGTGGA GCGCGGCATG GCCGTGGGCA TTGGAACAGA 12 0 0 

TAACGGGAAT AGTAATGACT CCGTAAACAT GATCGGAGAC ATGAAGTTTA TGGCCCATAT 12 60 

TCACCGCGCG GTGCATCGGG ATGCGGACGT GCTGACCCCA GAGAAGATTC TTGAAATGGC 13 2 0 

GACGATCGAT GGGGCGCGTT CGTTGGGGAT GGACCACGAG ATTGGTTCCA TCGAAACCGG 13 8 0 

CAAGCGCGCG GACCTTATCC TGCTTGACCT GCGTCACCCT CAGACGACTC CTCACCATCA 1440 

TTTGGCGGCC ACGATCGTGT TTCAGGCTTA CGGCAATGAG GTGGACACTG TCCTGATTGA 1500 

CGGAAACGTT GTGATGGAGA ACCGCCGCTT GAGCTTTCTT CCCCCTGAAC GTGAGTTGGC 1560 

GTTCCTTGAG GAAGCGCAGA GCCGCGCCAC AGCTATTTTG CAGCGGGCGA ACATGGTGGC 162 0 

TAACCCAGCT TGGCGCAGCC TCTAGGAAAT GACGCCGTTG CTGCATCCGC CGCCCCTTGA 1680 

GGAAATCGCT GCCATCTTGG CGCGGCTCGG ATTGGGGGGC GGACATGACC TTGATGGATA 174 0 

CAGAATTGCC ATGAATGCGG CACTTCCGTC CTTCGCTCGT GTGGAATCGT TGGTAGGTGA 1800 

GGGTCGACTG CGGGCGCCAG CTTCCCGAAG AAGTGAAAGG CCCGAG 1846 
(2) INFORMATION FOR SEQ ID NO : S : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ala Ser Met Val Thr Leu Thr Pro Leu Phe Ser Phe Ser Leu Leu Asn 
15 10 15 

Cys Thr Arg Lys Ala Ser Arg Ser Val Met Ser Ala Ser Ser Trp Leu 
20 25 30 

Val Thr Cys Gly Met Thr Thr Gin Leu Arg Cys Arg Phe Phe Asp Gly 
35 40 45 



lie lie Ser Ala 
50 

Gin Thr Leu Ser 
65 

Arg Arg Val Leu 



Val Ala Leu Gly 
100 

Val lie Asp Ala 
115 

His Thr His Val 
130 

Arg Gin Phe Tyr 
145 

Ala Met Arg Pro 



Glu Ala Val Arg 
180 

Ala lie Tyr Pro 
195 

Val Gly Val Arg 
210 

Gly Arg lie Gin 
225 

Val Glu Leu Cys 



lie Thr Ala Leu 
260 

Ser Val Trp Pro 
275 

Met Arg Trp Ala 
290 

Leu His Met Ala 
305 

Pro Ala Glu Tyr 



Val Ala His Cys 
340 

Arg His Asn Val 
355 

Gly Ser Gly Val 



Leu Arg Arg Val 
55 

lie Gin His Gly 
70 

Gly Asp Ser Trp 
85 

Val His Ala Glu 



Arg Gly Lys Val 
120 

Asn Gin lie Leu 
135 

Asp Trp Leu Phe 
150 

Glu Asp Val Ala 
165 

Ser Gly lie Thr 



Gly Asn lie Glu 
200 

Val Val Tyr Ala 
215 

Gly Tyr Val Asp 
230 

Ser lie Met Glu 
245 

Ser Asp Gin Tyr 



Ala Pro Ala Thr 
280 

Gin Ala Phe Ala 

295 

Glu Ser Asp His 
310 

Met Glu Cys Tyr 
325 

Val Tyr Phe Asp 



Lys Val Ala Ser 
360 

Ala Pro Val Pro 



Thr His Tyr Trp 

60 

Thr Leu Val Thr 
75 

Val His Val Gin 
90 

Ser Val Pro Pro 
105 

Val Leu Pro Gly 



Leu Arg Gly Gly 
140 

Asn Val Val Tyr 
155 

Val Ala Val Arg 
170 

Thr lie Asn Glu 
185 

Ala Ala Met Ala 



Arg Met Phe Phe 

220 

Ala Leu Lys Ala 
235 

Gly Thr Ala Val 
250 

His Gly Thr Ala 
265 

Thr Thr Ala Val 



Arg Asp Arg Ala 
300 

Asp Glu Arg lie 
315 

Gly Leu Leu Asp 
330 

Arg Lys Asp Val 
345 

Gin Val Val Ser 



Glu Met Val Glu 



Arg His lie Met 



Met Asp Gin Tyr 
80 

Asp Gly Arg lie 
95 

Pro Ala Asp Arg 
110 

Phe lie Asn Ala 
125 

Pro Ser His Gly 



Pro Gly Gin Lys 
160 

Leu Tyr Cys Ala 
175 

Asn Ala Asp Ser 
190 

Val Tyr Gly Glu 
205 

Asp Arg Met Asp 



Arg Ser Pro Gin 
240 

Ala Lys Asp Arg 
255 

Gly Gly Arg lie 
270 

Thr Val Glu Gly 
285 

Val Met Trp Thr 



His Gly Met Ser 
320 

Glu Arg Leu Gin 
335 

Arg Leu Leu His 
350 

Asn Ala Tyr Leu 
365 

Arg Gly Met Ala 



370 



375 



380 



Val Gly lie Gly Thr Asp Asn Gly Asn Ser Asn Asp Ser Val Asn Met 
385 390 395 400 

lie Gly Asp Met Lys Phe Met Ala His lie His Arg Ala Val His Arg 
405 410 415 

Asp Ala Asp Val Leu Thr Pro Glu Lys He Leu Glu Met Ala Thr He 
420 425 430 

Asp Gly Ala Arg Ser Leu Gly Met Asp His Glu He Gly Ser lie Glu 
435 440 445 

Thr Gly Lys Arg Ala Asp Leu He Leu Leu Asp Leu Arg His Pro Gin 
450 455 460 

Thr Thr Pro His His His Leu Ala Ala Thr He Val Phe Gin Ala Tyr 
465 470 475 480 

Gly Asn Glu Val Asp Thr Val Leu He Asp Gly Asn Val Val Met Glu 
485 490 495 

Asn Arg Arg Leu Ser Phe Leu Pro Pro Glu Arg Glu Leu Ala Phe Leu 
500 505 510 

Glu Glu Ala Gin Ser Arg Ala Thr Ala He Leu Gin Arg Ala Asn Met 
515 520 525 

Val Ala Asn Pro Ala Trp Arg Ser Leu Glu Met Thr Pro Leu Leu His 
530 535 540 

Pro Pro Pro Leu Glu Glu He Ala Ala He Leu Ala Arg Leu Gly Leu 
545 550 555 560 

Gly Gly Gly His Asp Leu Asp Gly Tyr Arg He Ala Met Asn Ala Ala 
565 570 575 

Leu Pro Ser Phe Ala Arg Val Glu Ser Leu Val Gly Glu Gly Arg Leu 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 614 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



580 



585 



590 



Arg Ala Pro Ala 
595 



Ser Arg Arg Ser Glu 
600 



Ser Ala Ala Thr Ala Ala Leu He Met Lys Ala Ser Met Val Thr Leu 
15 10 15 



Thr Pro Leu Phe 
20 

Arg Ser Val Met 
35 

Thr Gin Leu Arg 
50 

Arg Val Thr His 
65 

His Gly Thr Leu 



Ser Trp Val His 
100 

Ala Glu Ser Val 
115 

Lys Val Val Leu 
130 

lie Leu Leu Arg 
145 

Leu Phe Asn Val 



Val Ala Val Ala 
180 

lie Thr Thr lie 
195 

lie Glu Ala Ala 
210 

Tyr Ala Arg Met 
225 

Val Asp Ala Leu 



Met Glu Glu Thr 

260 

Gin Tyr His Gly 
275 

Ala Thr Thr Thr 
290 

Phe Ala Arg Asp 
305 

Asp His Asp Glu 



Ser Phe Ser Leu 



Ser Ala Ser Ser 
40 

Cys Arg Phe Phe 
55 

Tyr Trp Arg His 
70 

Val Thr Met Asp 
85 

Val Gin Asp Gly 



Pro Pro Pro Ala 
120 

Pro Gly Phe lie 
135 

Gly Gly Pro Ser 
150 

Val Tyr Pro Gly 
165 

Val Arg Leu Tyr 



Asn Glu Asn Ala 
200 

Met Ala Val Tyr 
215 

Phe Phe Asp Arg 
230 

Lys Ala Arg Ser 
245 

Ala Val Ala Lys 



Thr Ala Gly Gly 
280 

Ala Val Thr Val 
295 

Arg Ala Val Met 
310 

Arg lie His Gly 
325 



Leu Asn Cys Thr 
25 

Trp Leu Val Thr 



Asp Gly Val lie 
60 

lie Met Gin Thr 
75 

Gin Tyr Arg Arg 
90 

Arg lie Val Ala 
105 

Asp Arg Val lie 



Asn Ala His Thr 
140 

His Gly Arg Gin 
155 

Gin Lys Ala Met 
170 

Cys Ala Glu Ala 
185 

Asp Ser Ala lie 



Gly Glu Val Gly 
220 

Met Asp Gly Arg 
235 

Pro Gin Val Glu 
250 

Asp Arg lie Thr 
265 

Arg lie Ser Val 



Glu Gly Met Arg 
300 

Trp Thr Leu His 
315 

Met Ser Pro Ala 
330 



Arg Lys Ala Ser 
30 

Cys Gly Met Thr 
45 

Ser Ala Leu Arg 



Leu Ser lie Gin 
80 

Val Leu Gly Asp 
95 

Leu Gly Val His 
110 

Asp Ala Arg Gly 
125 

His Val Asn Gin 



Phe Tyr Asp Trp 
160 

Arg Pro Glu Asp 
175 

Val Arg Ser Gly 
190 

Tyr Pro Gly Asn 
205 

Val Arg Val Val 



lie Gin Gly Tyr 
240 

Leu Cys Ser lie 
255 

Ala Leu Ser Asp 
270 

Trp Pro Ala Pro 
285 

Trp Ala Gin Ala 



Met Ala Glu Ser 
320 

Asp Tyr Met Glu 
335 



Cys Tyr Gly Leu Leu Asp Glu Arg Leu Gin Val Ala His Cys Val Tyr 
340 345 350 

Phe Asp Arg Lys Asp Val Arg Leu Leu His Arg His Asn Val Lys Val 
355 360 365 

Ala Ser Gin Val Val Ser Asn Ala Tyr Leu Gly Ser Gly Val Ala Pro 
370 375 380 

Val Pro Glu Met Val Glu Arg Gly Met Ala Val Gly He Gly Thr Asp 
385 390 395 400 

Asn Gly Asn Ser Asn Asp Ser Val Asn Met He Gly Asp Met Lys Phe 
405 410 415 

Met Ala His He His Arg Ala Val His Arg Asp Ala Asp Val Leu Thr 
420 425 430 

Pro Glu Lys He Leu Glu Met Ala Thr He Asp Gly Ala Arg Ser Leu 
435 440 445 

Gly Met Asp His Glu He Gly Ser He Glu Thr Gly Lys Arg Ala Asp 
450 455 460 



Leu He Leu Leu Asp Leu Arg His Pro 



Gin Thr Thr Pro His His His 



465 470 475 480 

Leu Ala Ala Thr He Val Phe Gin Ala Tyr Gly Asn Glu Val Asp Thr 
485 490 495 

Val Leu He Asp Gly Asn Val Val Met Glu Asn Arg Arg Leu Ser Phe 
500 505 510 

Leu Pro Pro Glu Arg Glu Leu Ala Phe Leu Glu Glu Ala Gin Ser Arg 
515 520 525 

Ala Thr Ala He Leu Gin Arg Ala Asn Met Val Ala Asn Pro Ala Trp 
530 535 540 

Arc Ser Leu Glu Met Thr Pro Leu Leu His Pro Pro Pro Leu Glu Glu 

ccn R55 560 

545 550 === 

He Ala Ala He Leu Ala Arg Leu Gly Leu Gly Gly Gly His Asp Leu 
565 570 575 

ASP Gly Tyr Arg He Ala Met Asn Ala Ala Leu Pro Ser Phe Ala Arg 
580 585 590 

val Glu Ser Leu Val Gly Glu Gly Arg Leu Arg Ala Pro Ala Ser Arg 
595 600 605 

Arg Ser Glu Arg Pro Glu 
610 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGGTATCGGG GAATTCTTGA GCGCGGCCAC AGCAGCCNTG ATCATGAAGG CGAGCATGGT 60 

GACCTNGACG CCGTNTTTTN GTTNTTTTTT GTTGAACTGC ACGCGAAAGG TTCCAGGTCG 12 0 

GTGATGTCCG CGTCGTCGTG GTTGGTGACG TGCGGGATGA CCACCCAGNT GCGGTGCAGG 180 

TTTTTCGATG GCATAATATC TGCGTTGCGA CGTGTAACAC ACTANTGGAG ACATATCATG 24 0 

CAAACGCTCA GCATCCAGCA CGGTACCCTC GTCACGATGG ATCAGTACCG CAGAGTCCTT 3 00 

GGGGATAGCT GGGTTCACGT GCAGGATGGA CGGATCGTCG CGCTCGGAGT GCACGCCGAG 3 60 

TCGGTGCCTC CGCCAGCGGA TCGGGTGATC GATGCACGCG GCAAGGTCGT GTTACCCGGT 420 

TTCATCAATG CCCACACCCA TGTGAACCAG ATCCTCCTGC GCGGAGGGCC CTCGCACGGG 4 80 

CGTCAATTCT ATGACTGGCT GTTCAACGTT GTGTATCCGG GACAAAAGGC GATGAGACCG 54 0 

GAGGA 54 5 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

CCTGCGCGGA GGGCCTCCGC ACGGGCGTCA ATTCTATGAC TGGCTGTTCA ACGTTGTGTA 60 

TCCGGGACAA AAGGCGATGA GACCGGAGGA CGTAGCGGTG GCGGTGAGGT TGTATTGTGC 120 

GGAAGCTGTG CGCAGCGGGA TTACGACGAT CAACGAAAAC GCCGATTCGG CCATCTACCC 180 

AGGCAACATC GAGGCCGCGA TGGCGGTCTA TGGTGAGGTG GGTGTGAGGG TCGTCTACGC 24 0 

CCGCATGTTC TTTGATCGGA TGGACGGGCG CATTCAAGGG TATGTGGACG CCTTGAAGGC 3 00 

TCGCTCTCCC CAAGTCGAAC TGTGCTCGAT CATGGAGGAA ACGGCTGTGG CCAAAGATCG 3 60 



GATCACAGCC CTGTCAGATC AGTATCATGG CACGGCAGGA GGTCCTATAT CAGTTTGGCC 42 0 

CGCTCCTGCC ACTACCACGG CGGTGACATT TAAANGAATC CATGGGCCAA CCTCCCCCGT 48 0 

GATCCGGCGG TAATGTGAC 499 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

TNGCAGGTTG TGAGCATGCT ACTTCGGTTC AGGNGTGGCC CCCGTGCCAG AGATGGTGGA 60 

GCGCGGCATG GCCGTGGGCA TTGGAACAGA TAACGGGAAT AGTAATGACT CCGTAAACAT 12 0 

GATCGGAGAC ATGAAGTTTA TGGCCCATAT TCACCGCGCG GTGCATCGGG ATGCGGACGT 18 0 

GCTGACCCCA GAGAAGATTN TTGAAATGGC GACGATCGAT GGGGCGCGTT TCGTTGGGGA 24 0 

TGGACCACGA GATTGGTTCC ATCGAAACCG GCAAGCGCGC GGACCTTATC CTGCTTGACC 3 00 

TGCGTCACCC TCAGACGACT CCTCACCATC ' ATTTGGCGGC CACGATCGTG TTTCAGGCTT 3 60 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 443 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CGGCCACGAT CGTGTTTCAG GCTTACGGCA ATGAGGTGGA CACTGTCCTG ATTGACGGAA 60 

ACGTTGTGAT GGAGAACCGC CGCTTGAGCT TTCTTCCCCC TGAACGTGAG TTGGCGTTCC 12 0 

TTGAGGAAGC GCAGAGCCGC GCCACAGCTA TTTTGCATCG GGCGAAACAT GGTGGCTAAC 18 0 

CCAGCTTGGC GCAGCCTCTA GGAAATGACG CCGTTGCTGC ATCCGCCGCC CCTTGAGGAA 24 0 

ATCGCTGCCA TCTTGGCGCG GCTCGGATTG GGGGGCGGAC ATGACCTTGA TGGATACAGA 3 00 

ATTGCCATGA ATGCGGCACT TCCGTCCTTC GCTCGTGTGG AATCGTTGGT AGGTGAGGGT 36 0 



CGACTGCGGG CGCCAGCTTC CCGAAGAGGT GAAAGCCCGA GGATCCTCTA GAGTCCGATT 42 0 

TTTCCGATGT CATCACCGGC GCG 443 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCTGCGCGGA GGCCTCCGCA CGGGCGTCAA TTCTATGACT GGCTGTTCAA CGTTGTGTAT 6 0 

CCGGGACAAA AGGCGATGAG ACCGGAGGAC GTANCGGTGG CGGTGAGGTT GTATTGTGCG 12 0 

GAAGCTGTGC GCAGCGGGAT TACGACGATC AACGAAAACG CCGATTCGGC CATCTACCCA 18 0 

GGCAACATCG AGGCCGCGAT GGCGGTCTAT GGTGAGGTGG GTGTGAGGGT CGTCTACGCC 24 0 

CGCATGTTCT TTGATCGGAT GGACGGGCGC ATTCAAGGGT ATGTGGACGC CTTGAAGGCT 3 00 

CGCTCTCCCC AAGTCGAACT GTGCTCGATC ATGGAGGAAA CGGCTGTGGC CAAAGATCGG 3 60 

ATCACANCCC TGTCAGATCA NTATCATGGC ACGGCANGAG GTCCTATATC ANTTTGGCCC 42 0 

GCTCCTGCCA CTACCACNGC GGTGACATTT NAANGAATTC CATNGGCACA ACCTTCCCCC 480 

GTGATCNGGC GGTAATGTNG ACCCA 505 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Pro His Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Leu Tyr Pro 
15 10 15 

Gly Gin Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu 
20 25 30 

Tyr Cys Ala Glu Ala Val Arg Ser Gly lie Thr Thr He Asn Glu Asn 
35 40 45 



0 



Ala Asp Ser Ala 
50 

Tyr Gly Glu Val 
65 

Arg Met Asp Gly 



Ser Pro Gin Val 
100 

Lys Asp Arg lie 
115 

Gly Arg lie Ser 
130 



lie Tyr Pro Gly 
55 

Gly Val Arg Val 
70 

Arg lie Gin Gly 
85 

Glu Leu Cys Ser 



Thr Ala Leu Ser 
120 

Val Trp Pro Ala 
135 



Asn lie Glu Ala 
60 

Val Tyr Ala Arg 
75 

Tyr Val Asp Ala 
90 

lie Met Glu Glu 
105 

Asp Gin Tyr His 



Pro Ala Thr Thr 
140 



Ala Met Ala Val 



Met Phe Phe Asp 
80 

Leu Lys Ala Arg 
95 

Thr Ala Val Ala 
110 

Gly Thr Ala Gly 
125 

Thr Ala Val Thr 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DE; 

Ser His Gly Arg 
1 

Gly Gin Lys Ala 
20 

Tyr Cys Ala Glu 
35 

Ala Asp Ser Ala 
50 

Tyr Gly Glu Val 
65 

Arg Met Asp Gly 



Ser Pro Gin Val 
100 

Lys Asp Arg lie 
115 

Gly Arg lie Ser 
130 



iCRIPTION: SEQ I] 

Gin Phe Tyr Asp 
5 

Met Arg Pro Glu 



Ala Val Arg Ser 
40 

lie Tyr Pro Gly 
55 

Gly Val Arg Val 
70 

Arg lie Gin Gly 
85 

Glu Leu Cys Ser 



Thr Ala Leu Ser 
120 

Val Trp Pro Ala 
135 



) NO: 13 : 

Trp Leu Phe Asn 
10 

Asp Val Ala Val 
25 

Gly lie Thr Thr 



Asn lie Glu Ala 
60 

Val Tyr Ala Arg 
75 

Tyr Val Asp Thr 
90 

He Met Glu Glu 
105 

Asp Gin Tyr His 



Pro Ala Thr Thr 
140 



Val Leu Tyr Pro 
15 

Ala Val Arg Leu 
30 

He Asn Glu Asn 
45 

Ala Met Ala Val 



Met Phe Phe Asp 
80 

Leu Lys Ala Arg 
95 

Thr Ala Val Ala 
110 

Gly Thr Ala Gly 
125 

Thr Ala Val Thr 



(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro His Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro 
15 10 15 

Gly Gin Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu 
20 25 30 

Tyr Cys Ala Glu Ala Val Arg Ser Gly lie Thr Thr lie Asn Glu Asn 
35 40 45 

Ala Asp Ser Ala lie Tyr Pro Gly Asn lie Glu Ala Ala Met Ala Val 
50 55 60 

Tyr Gly Glu Val Gly Val Arg Val Val Tyr Ala Arg Met Phe Phe Asp 
65 70 75 80 

Arg Met Asp Gly Arg lie Gin Gly Tyr Val Asp Ala Leu Lys Ala Arg 
85 90 95 

Ser Pro Gin Val Glu Leu Cys Ser lie Met Glu Glu Thr Ala Val Ala 
100 105 110 

Lys Asp Arg lie Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly 
115 120 125 

Gly Arg lie Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ser His Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Leu Tyr Pro 
15 10 15 



0 



Gly Gin Lys Ala 
20 

Tyr Cys Ala Glu 
35 

Asn Ala Asp Ser 
50 

Val Tyr Gly Glu 
65 

Asp Arg Met Asp 



Arg Ser Pro Gin 
100 

Ala Lys Asp Arg 
115 

Gly Gly Arg lie 
130 



Met Arg Pro Glu 



Ala Val Arg Ser 
40 

Ala lie Tyr Pro 
55 

Val Gly Val Arg 
70 

Gly Arg lie Gin 
85 

Val Glu Leu Cys 



lie Thr Ala Leu 
120 

Ser Val Trp Pro 
135 



Asp Val Ala Val 
25 

Gly lie Thr Thr 



Gly Asn lie Glu 
60 

Val Val Tyr Ala 
75 

Gly Tyr Val Asp 
90 

Ser lie Met Glu 
105 

Ser Asp Gin Tyr 



Ala Pro Ala Thr 
140 



Ala Val Arg Leu 
30 

lie Asn Glu Asn 
45 

Ala Ala Met Ala 



Arg Met Phe Phe 
80 

Thr Leu Lys Ala 
95 

Glu Thr Ala Val 
110 

His Gly Thr Ala 
125 

Thr Thr Ala Val 



Thr 
145 



(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DE; 

Ser His Gly Arg 
1 

Gly Gin Lys Ala 
20 

Tyr Cys Ala Glu 
35 

Ala Asp Ser Ala 
50 

Tyr Gly Glu Val 
65 

Arg Met Asp Gly 



iCRIPTION: SEQ I] 

Gin Phe Tyr Asp 
5 

Met Arg Pro Glu 



Ala Val Arg Ser 
40 

lie Tyr Pro Gly 
55 

Gly Val Arg Val 
70 

Arg He Gin Gly 
85 



> NO: 16 : 

Trp Leu Phe Asn 
10 

Asp Val Ala Val 
25 

Gly He Thr Thr 



Asn He Glu Ala 
60 

Val Tyr Ala Arg 
75 

Tyr Val Asp Ala 
90 



Val Val Tyr Pro 
15 

Ala Val Arg Leu 
30 

He Asn Glu Asn 
45 

Ala Met Ala Val 



Met Phe Phe Asp 
80 

Leu Lys Ala Arg 
95 



Ser Pro Gin Val Glu Leu Cys Ser lie Met Glu Glu Thr Ala Val Ala 
100 105 110 



Lys Asp Arg He Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly 
115 120 125 

Gly Arg He Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGCGAAAGGC TTCCAGGTCG GTGATGTCCG CGTCGTCGTG GTTGGTGACG TGCGGGATGA 6 0 

CCACCCAGTC GCGGTGCAGG TTTTTCGATG GCATAATATC TGCGTTGCGA CGTGTAACAC 12 0 

ACTATTGGAG ACATATCATG CAAACGCTCA GCATCCAGCA CGGTACCCTC GTCACGATGG 18 0 

ATCAATACCG CAGAGTCCTT GGGGATAGCT GGGTTCACGT GCAGGATGGA CGGATCGTCG 24 0 

CGCTCGGAGT GCACGCCAAG TCGGTGCCTC CGCCAGCGGA TCGGGTGATC GATGCACGCG 3 00 

GCAAGGTCGT GTTACCCGGT TTCATCAATG CCCACACCCA TGTGAACCAG ATCCTCCTGC 3 60 

GCGGAGGGCC CTCGCACGGG CGTCAATTCT ATGACTGGCT GTTCAACGTT GTGTATCCGG 42 0 

GACAAAAGGC GATGAGACCG GAGGACGTAG CGGTGGCGGT GAGGTTGTAT TGTGCGGAAG 48 0 

CTGTGCGCAG CGGGATTACG ACGATCAACG AAAACGCCGA TTCGGCCATC TACCCAGGCA 54 0 

ACATCGAGGC CGCGATGGCG GTCTATGGTG AGGTGGGTGT GAGGGTCGTC TACGCCCGCA 60 0 

TGTTCTTTGA TCGGATGGAC GGGCGCATTC AAGGGTATGT GGACGCCTTG AAGGCTCGCT 66 0 

CTCCCCAAGT CGAACTGTGC TCGATCATGG AGGAAACGGC TGTGGCCAAA GATCGGATCA 72 0 

CAGCCCTGTC AGATCAGTAT CATGGCACGG CAGGAGGTCG TATATCAGTT TGGCCCGCTC 78 0 

CTGCCACTAC CACGGCGGTG ACAGTTGAAG GAATGCGATG GGCACAAGCC TTCGCCCGTG 84 0 

ATCGGGCGGT AATGTGGACG CTTCACATGG CGGAGAGCGA TCATGATGGG CGGATTCATG 90 0 

GGATGAGTCC CGCCGAGTAC ATGGAGTGTT ACGGACTCTT GGATGAGCGT CTGCAGGTCG 96 0 

CGCATTGCGT GTACTTTGAC CGGAAGGATG TTCGGCTGCT GCACCGCCAC AATGTGAAGG 102 0 

TCGCGTCGCA GGTTGTGAGC AATGCCTACC TCGGCTCAGG GGTGGCCCCC GTGCCAGAGA 108 0 



• 



TGGTGGAGCG CGGCATGGCC GTGGGCATTG GAACAGATAA CGGGAATAGT AATGACTCCG 1140 

TAAACATGAT CGGAGACATG AAGTTTATGG CCCATATTCA CCGCGCGGTG CATCGGGATG 1200 

CGGACGTGCT GACCCCAGAG AAGATTCTTG AAATGGCGAC GATCGATGGG GCGCGTTCGT 1260 

TGGGGATGGA CCACGAGATT GGTTCCATCG AAACCGGCAA GCGCGCGGAC CTTATCCTGC 1320 

TTGACCTGCG TCACCCTCAG ACGACTCCTC ACCATCATTT GGCGGCCACG ATCGTGTTTC 1380 

AGGCTTACGG CAATGAAGTG GACACTGTCC TGATTGACGG AAACGTTGTG ATGGAGAACC 144 0 

GCTGCTTGAG CTTTCTTCCC CCTGAACGTG AGTTGGCGTT CCTTGAGGGA GCGCAGAGCC 150 0 

GCGCCACAGC TATTTTGCAG CGGGCGAACA TGGTGGCTAA CCCAGCTTGG CGCAGCCTCT 1560 

AGGAAATGAC GCCGTTGCTG CATCCGCCGC CCCTTGAGGA AATCGCTGCC ATCTTGGCGC 1620 

GGCTCGGATT GGG 163 3 
(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TCGTGGTTGG TGACGTGCGG GATGACCACC CAGTCGCGGT GCAGGTTTTT CGATGGCATA 60 

ATATCTGCGT TGCGACGTGT AACACACTAT TGGAGACATA TCATGCAAAC GCTCAGCATC 120 

CAGCACGGTA CCCTCGTCAC GATGGATCAG TACCGCAGAG TCCTTGGGGA TAGCTGGGTT 18 0 

CACGTGCAGG ATGGACGGAT CGTCGCGCTC GGAGTGCACG CCGAGTCGGT GCCTCCGCCA 24 0 

GCGGATCGGG TGATCGATGC ACGCGGCAAG GTCGTGTTAC CCGGTTTCAT CAATGCCCAC 3 00 

ACCCATGTGA ACCAGATCCT CCTGCGCGGA GGGCCCTCGC ACGGGCGTCA ATTCTATGAC 3 60 

TGGCTGTTCA ACGTTGTGTA TCCGGGACAA AAGGCGATGA GACCGGAGGA CGTAGCGGTG 42 0 

GCGGTGAGGT TGTATTGTGC GGAAGCTGTG CGCAGCGGGA TTACGACGAT CAACGAAAAC 48 0 

GCCGATTCGG CCATCTACCC AGGCAACATC GAGGCCGCGA TGGCGGTCTA TGGTGAGGTG 54 0 

GGTGTGAGGG TCGTCTACGC CCGCATGTTC TTTGATCGGA TGGACGGGCG CATTCAAGGG 600 

TATGTGGACG CCTTGAAGGC TCGCTCTCCC CAAGTCGAAC TGTGCTCGAT CATGGAGGAA 660 

ACGGCTGTGG CCAAAGATCG GATCACAGCC CTGTCAGATC AGTATCATGG CACGGCAGGA 72 0 

GGTCGTATAT CAGTTTGGCC CGCTCCTGCC ACTACCACGG CGGTGACAGT TGAAGGAATG 78 0 



CGATGGGCAC AAGCCTTCGC CCGTGATCGG GCGGTAATGT GGACGCTTCA CATGGCGGAG 84 0 

AGCGATCATG ATGAGCGGAT TCATGGGATG AGTCCCGCCG AGTACATGGA GTGTCACGGA 90 0 

CTCTTGGATG AGCGTCTGCA GGTCGCGCAT TGCGTGTACT TTGACCGGAA GGATGTTCGG 96 0 

CTGCTGCACC GCCACAATGT GAAGGTCGCG TCGCAGGTTG TGAGCAATGC CTACCTCGGC 1020 

TCAGGGGTGG CCCCCGTGCC AGAGATGGTG GAGCGCGGCA TGGCCATGGG CATTGGAACA 108 0 

GATAACGGGA ATAGTAATGA CTCCGTAAAC ATGATCGGAG ACATGAAGTT TATGGCCCAT 114 0 

ATTCACCGCG CGGTGCATCG GGATGCGGAC GTGCTGACCC CAGAGAAGAT TCTTGAAATG 12 0 0 

GCGACGATCG ATGGGGCGCG TTCGTTGGGA ATGGACCACG AGATTGGTTC CATCGAAACC 1260 

GGCAAGCGCG CGGACCTTAT CCTGCTTGAC CTGCGTCACC CTCAGACGAC TCCTCACCAT 132 0 

CATTTGGCGG CCACGATCGT GTTTCAGGCT TACGGCAATG AGGTGGACAC TGTCCTGATT 13 8 0 

GACGGAAACG TTGTGATGGA GAACCGCCGC TTGAGCTTTC TTCCCCCTGA ACGTGAGTTG 144 0 

GCGTTCCTTG AGGAAGCGCA GAGCCGCGCC ACAGCTATTT TGCAGCGGGC GAACATGGTG 150 0 

GCTAACCCAG CTTGGCGCAG CCTCTAGGAA ATGACGCCGT TGCTGCATCC GCCGCCCCTT 156 0 

GAGGAAATCG CTGCCATCTT GGCGCGGCTC GGATTGGG 159 8 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 19: 

ACGTGCGGGA TGACCACCCA GTTGCGGTGC AGGTTTTTCG ATGGCGTAAT ATCTGCGTTG 60 

CGACGTGTAA CACACTATTG GAGACATATC ATGCAAACGC TCAGCATCCA GCACGGTACC 12 0 

CTCGTCACGA TGGATCAGTA CCGCAGAGTC CTTGGGGATA GCTGGGTTCA CGTGCAGGAT 180 

GGACGGATCG TCGCGCTCGG AGTGCACGCC GAGTCGGTGC CTCCGCCAGC GGATCGGGTG 24 0 

ATCGATGCAC GCGGCAAGGT CGTGTTACCC GGTTTCATCA ATGCCCACAC CCATGTGAAC 3 00 

CAGATCCTCC TGCGCGGAGG GCCCTCGCAC GGGCGTCAAT TCTATGACTG GCTGTTCAAC 3 60 

GTTGTGTATC CGGGACAAAA GGCGATGAGA CCTGAGGACG TAGCGGTGGC GGTGAGGTTG 42 0 

TATTGTGCGG AAGCTGTGCG CAGCGGGATT ACGACGATCA ACGAAAACGC CGATTCGGCC 480 

ATCTACCCAG GCAACATCGA GGCCGCGATG GCGGTCTATG GTGAGGTGGG TGTGAGGGTC 54 0 



■ # 

GTCTACGCCC GCATGTTCTT TGATCGGATG GACGGGCGCA TTCAAGGGTA TGTGGACGCC 600 

TTGAAGGCTC GCTCTCCCCA AGTCGAACTG TGCTCGATCA TGGAGGAAAC GGCTGTGGCC 660 

AAAGATCGGA TCACAGCCCT GTCAGATCAG TATCATGGCA CGGCAGGAGG TCGTATATCA 72 0 

GTTTGGCCCG CTCCTGCCAC TACCACGGCG GTGACAGTTG AAGGAATGCG ATGGGCACAA 780 

GCCTTCGCCC GTGATCGGGC GGTAATGTGG ACGCTTCACA TGGCGGAGAG CGATCATGAT 84 0 

GAGCGGATTC ATGGGATGAG TCCCGCCGAG TACATGGAGT GTTACGGACT CTTGGATGAG 90 0 
CGTCTGCAGG TCGCGCATTG CGTGTACTTT GACCGGAAGG ATGTTCGGCT GCTGCACCGC ' 96 0 

CACAATGTGA AGGTCGCGTC GCAGGTTGTG AGCAATGCCT ACCTCGGCTC AGGGGTGGCC 102 0 

CCCGTGCCAG AGATGGTGGA GCGCGGCATG GCCGTGGGCA TTGGAACAGA TAACGGGAAT 1080 

AGTAATGACT CCGTAAACAT GATCGGAGAC ATGAAGTTTA TGGCCCATAT TCACCGCGCG 114 0 

GTGCATCGGG ATGCGGACGT GCTGACCCCA GAGAAGATTC TTGAAATGGC GACAATCGAT 12 0 0 

GGGGCGCGTT CGTTGGGAAT GGACCACGAG ATTGGTTCCA TCGAAACCGG CAAGCGCGCG 126 0 

GACCTTATCC TGCTTGACCT GCGTCACCCT CAGACGACTC CTCACCATCA TTTGGCGGCC 132 0 

ACGATCGTGT TTCAGGCTTA CGGCAATGAG GTGGACACTG TCCTGATTGA CGGAAACGTT 13 8 0 

GTGATGGAGA ACCGCCGCTT GAGCTTTCTT CCCCCTGAAC GTGAGTTGGC GTTCCTTGAG 144 0 

GAAGCGCAGA GCCGCGCCAC AGCTATTTTG CAGCGGGCGA ACATGGTGGC TAACCCAGCT 150 0 

TGGCGCAGCC TCTAGGAAAT GACGCCGTTG CTGCATCCGC TGCCCCTTGA GGAAATCGCT 1560 

GCCATCTTGG CGCGGCTCGG ATTGGG 158 6 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CGTGGTTGGT GACGTGGGGG ATGACCACCC AGTCGCGGTG CAGGTTTTTC GATGGCATAA 6 0 

TATCTGCGTT GCGACGTGTA ACACACTATT GGAGACATAT CATGCAAACG CTCAGCATCC 12 0 

AGCACGGTAC CCTCGTCACG ATGGATCAGT ACCGCAGAGT CCTTGGGGAT AGCTGGGTTC 18 0 

ACGTGCAGGA TGGACGGATC GTCGCGCTCG GAGTGCACGC CGAGTCGGTG CCTCCGCCAG 24 0 

CGGATCAGGT GATCGATGCA CGCGGCAAGG TCGTGTTACC CGGTTTCATC AATGCCCACA 3 00 



CCCATGTGAA CCAGATCCTG CTGCGCGGAG GGCCCTCGCA CGGGCGTCAA TTCCATGACT 3 60 

GGCTGTTCAA CGTTGTGTAT CCGGGACAAA AGGCGATGAG ACCGGAGGAC GTAGCGGTGG 420 

CGGTGAGGTT GTATTGTGCA GAAGCTGTGC GCAGCGGGAT TACGACGATT AACGAAAACG 480 

CCGATTCGGC CATCTACCCA GGCAACATCG AGGCCGCGAT GGCGGTCTAT GGTGAGGTGG 54 0 

GTGTGAGGGT CGTCTACGCC CGCATGTTCT TTGATCGGAT GGACGGGCGC ATTCAAGGGT 600 

ATGTGGACGC CTTGAAGGCT CGCTCTCCCC AAGTCGAACT GTGCTCGATC ATGGAGGAAA 66 0 

CGGCTGTGGC CAAAGATCGG ATCACAGCCC TGTCAGATCA GTATCATGGC ACGGCAGGAG 72 0 

GTCGTATATC AGTTTGGCCC GCTCCTGCCA CTACCACGGC GGTGACAGTT GAAGGAATGC 780 

GATGGGCACA AGCCTTCGCC CGTGATCGGG CGGTAATGTG GACGCTTCAC ATGGCGGAGA 84 0 

GCGATCATGA TGGGCGGATT CATGGGATGA GTCCCGCCGA GTACATGGAG TGTTACGGAC 90 0 

TCTTGGATGA GCGTCTGCAG GTCGCGCATT GCGTGTACTT TGACCGGAAG GATGTTCGGC 96 0 

TGCTGCACCG CCACAATGTG AAGGTCGCGT CGCAGGTTGT GAGCAATGCC TACCTCGGCT 102 0 

CAGGGGTGGC CCCCGTGCCA GAGATGGTGG AGCGCGGCAT GGCCGTGGGC ATTGGAACAG 10 80 

ATAACGGGAA TAGTAATGAC TCCGTAAACA TGATCGGAGA CATGAAGTTT ATGGCCCATA 114 0 

TTCACCGCGC GGTGCATCGG GATGCGGACG TGCTGACCCC AGAGAAGATT CTTGAAATGG 12 0 0 

CAACGATCGA TGGGGCGCGT TCGTTGGGAA TGGACCACGA GATTGGTTCC ATCGAAACCG 12 6 0 

GCAAGCGCGC GGACCTTATC CTGCTTGACC TGCGTCACCC TCAGACGACT CCTCACCATC 132 0 

ATTTGGCGGC CACGATCGTG TTTCAGGCTT ACGGCAATGA GGTGGACACT GTCCTGATTG 13 80 

ACGGAAACGT TGTGATGGAG AACCGCCGCT TGAGCTTTCT TCCCCCTGAA CGTGAGTTGG 144 0 

CGTTCCTTGA GGAAGCGCAG AGCCGCGCCA CAGCTATTTT GCAGCGGGCG AACATGGTGG 15 0 0 

CTAACCCAGC TTGGCGCAGC CTCTAGGAAA TGACGCCGTT GCTGCATCCG CCGCCCCTTG 156 0 

AGGAAATCGC TGCCATCTTG GCGCGGCTCG GATTGGG 15 97 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1674 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GTGACCTTGA CGCCGCTCTT TTCGTTCTCT TTGTTGAACT GCACGCGAAT GGCTTCCAGT 6 0 



TCGATGATGT CCGCGTCGTC GTGGTTGGTG ACGTGCGGGA TGACCACCCA GTCGCGGTGC 12 0 

AGGTTTTTCG ATGGCATAAT ATCTGCGTTG CGACGTGTAA CACACTATTG GAGACATATC 18 0 

ATGCAAACGC TCAGCATCCA GCACGGTACC CTCGTCACGA TGGATCAGTA CCGCAGAGTC 24 0 

CTTGGGGATA GCTGGGTTCA CGTGCAGGAT GGACGGATCG TCGCGCTCGG AGTGCACGCC 3 00 

GAGTCGGTGC CTCCGCCAGC GGATCGGGTG ATTGATGCAC GCGGCAAGGT CGTGTTACCC 3 60 

GGTTTCATCA ATGCCCACAC CCATGTGAAC CAGATCCTCC TGCGCGGAGG CCTCGCACGG 42 0 

GCGTCAATTC TATGACTGGC TGTTCAACGT TGTGTATCCG GGACAAAAGG CGATGAGACC 48 0 

GGAGGACGTA GCGGTGGCGG TGAGGTTGTA TTGTGCGGAA GCTGTGCGCA GCGGGATTAC 54 0 

GACGATCAAC GAAAACGCCG ATTCGGCCAT CTACCCAGGC AACATCGAGG CCGCGATGGC 600 

GGTCTATGGT GAGGTGGGTG TGAGGGTCGT CTACGCCCGC ATGTTCTTTG ATCGGATGGA 660 

CAGGCGCATT CAAGGGTATG TGGACGCCTT GAAGGCTCGC TCTCCCCAAG TCGAACTGTG 72 0 

CTCGATCATG GAGGAAACGG CTGTGGCCAA AGATCGGATC ACAGCCCTGT CAGATCAGTA 780 

TCATGGCACG GCAGGAGGTC GTATATCAGT TTGGCCCGCT CCTGCCACTA CCACGGCGGT 84 0 

GACAGTTGAA GGAATGCGAT GGGCACAAGC CTTCGCCCGT GATCGGGCGG TAATGTGGAC 90 0 

GCTTCACATG GCGGAGAGCG ATCATGATGA GCGGATTCAT GGGATGAGTC CCGCCGAGTA 96 0 

CATGGAGTGT TACGGACTCT TGGATGAGCG TCTGCAGGTC GCGCATTGCG TGTACTTTGA 102 0 

CCGGAAGGAT ATTCGGCTGC TGCACCGCCA CAATGTGAAG GTCGCGTCGC AGGCTGTGAG 108 0 

CAATGCCTAC CTCGGCTCAG GGGTGGCCCC CGTGCCAGAG ATGGTGGAGC GCGGCATGGC 114 0 

CGTGGGCATT GGAACAGATA ACGGGAATAG TAATGACTCC GTAAACATGA TCGGAGACAT 12 0 0 

GAAGTTTATG GCCCATATTC ACCGCGCGGT GCATCGGGAT GCGGACGTGC TGACCCCAGA 12 6 0 

GAAGATTCTT GAAATGGCGA CGATCGATGG GGCGCGTTCG TTGGGAATGG ACCACGAGAT 132 0 

TGGTTCCATC GAAACCGGCA AGCGCGCGGA CCTTATCCTG CTTGACCTGC GTCACCCTCA 13 8 0 

GACGACTCCT CACCATCATT TGGCGGCCAC GATCGTGTTT CAGGCTTACG GCAATGAGGT 144 0 

GGACACTGTC CTGATTGACG GAAACGTTGT GATGGAGAAC CGCCGCTTGA GCTTTCTTCC 150 0 

CCCTGAACGT GAGTTGGCGT TCCTTGAGGA AGCGCAGAGC CGCGCCACAG CTATTTTGCA 156 0 

GCGGGCGAAC ATGGTGGCCA ACCCAGCTTG GCGCAGCCTC TAGGAAATGA CGCCGTTGCT 162 0 

GCATCCGCCG CCCCTTGAGG AAATCGCTGC CATCTTGGCG CAGCTCGGAT TGGG 1674 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Gin Thr Leu Ser He Gin His Gly Thr Leu Val Thr Met Asp Gin 
1 5 10 15 



Tyr Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg 
20 25 

He val Ala Leu Gly Val His Ala Lys Ser Val Pro Pro Pro Ala Asp 
35 40 45 

Arg val He Asp Ala Arg Gly Lys Val Val Leu Pro Gly Phe He Asn 
50 55 60 

His Thr His val Asn Gin He Leu Leu Arg Gly Gly Pro Ser Hi 



Ala 
65 



70 



75 



80 



Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro Gly Gin 
85 90 

Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu Tyr Cys 
100 110 



Ala Glu Ala Val Arg Ser Gly He Thr Thr He Asn Glu Asn Ala Asp 
115 120 125 

ser Ala He Tyr Pro Gly Asn He Glu Ala Ala Met Ala Val Tyr Gly 
130 135 140 

Glu val Gly val Arg Val Val Tyr Ala Arg Met Phe Phe Asp Arg Met 
145 150 155 

ASP Gly Arg He Gin Gly Tyr Val Asp Ala Leu Lys Ala Arg Ser Pro 
165 170 175 

Gin val Glu Leu Cys Ser He Met Glu Glu Thr Ala Val Ala Lys Asp 
180 185 190 



Arg He Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly Gly Arg 
195 200 205 

He ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr Val Glu 



91 R 220 
210 



Gly Met Arg Trp Ala Gin Ala Phe Ala Arg Asp Arg Ala Val Met Trp 

230 235 



225 



Thr Leu His Met Ala Glu Ser Asp His Asp Gly Arg He His Gly Met 
245 250 255 

Ser Pro Ala Glu Tyr Met Glu Cys Tyr Gly Leu Leu Asp Glu Arg Leu 
260 265 270 

Gin Val Ala His Cys Val Tyr Phe Asp Arg Lys Asp Val Arg Leu Leu 
275 280 285 

His Arg His Asn Val Lys Val Ala Ser Gin Val Val Ser Asn Ala Tyr 
290 295 300 

Leu Gly Ser Gly Val Ala Pro Val Pro Glu Met Val Glu Arg Gly Met 
305 310 315 320 

Ala Val Gly He Gly Thr Asp Asn Gly Asn Ser Asn Asp Ser Val Asn 
325 330 335 

Met He Gly Asp Met Lys Phe Met Ala His He His Arg Ala Val His 
340 345 350 

Arg Asp Ala Asp Val Leu Thr Pro Glu Lys He Leu Glu Met Ala Thr 
355 360 365 

He Asp Gly Ala Arg Ser Leu Gly Met Asp His Glu He Gly Ser He 
370 375 380 

Glu Thr Gly Lys Arg Ala Asp Leu He Leu Leu Asp Leu Arg His Pro 
385 390 395 400 

Gin Thr Thr Pro His His His Leu Ala Ala Thr He Val Phe Gin Ala 
405 410 415 

Tyr Gly Asn Glu Val Asp Thr Val Leu He Asp Gly Asn Val Val Met 
420 425 430 

Glu Asn Arg Cys Leu Ser Phe Leu Pro Pro Glu Arg Glu Leu Ala Phe 
435 440 445 

Leu Glu Gly Ala Gin Ser Arg Ala Thr Ala He Leu Gin Arg Ala Asn 
450 455 460 

Met Val Ala Asn Pro Ala Trp Arg Ser Leu Glu Met Thr Pro Leu Leu 
465 470 475 480 

His Pro Pro Pro Leu Glu Glu He Ala Ala He Leu Ala Arg Leu Gly 
485 490 495 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Gin Thr Leu Ser He Gin His Gly Thr Leu Val Thr Met Asp Gin 
15 10 15 

Tyr Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg 
20 25 30 

He Val Ala Leu Gly Val His Ala Glu Ser Val Pro Pro Pro Ala Asp 
35 40 45 

Arg Val He Asp Ala Arg Gly Lys Val Val Leu Pro Gly Phe He Asn 
50 55 60 

Ala His Thr His Val Asn Gin He Leu Leu Arg Gly Gly Pro Ser His 
65 70 75 ^ 80 

Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro Gly Gin 
85 90 95 

Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu Tyr Cys 
100 105 110 

Ala Glu Ala Val Arg Ser Gly He Thr Thr He Asn Glu Asn Ala Asp 
115 120 125 

Ser Ala He Tyr Pro Gly Asn He Glu Ala Ala Met Ala Val Tyr Gly 
130 135 140 

Glu Val Gly Val Arg Val Val Tyr Ala Arg Met Phe Phe Asp Arg Met 
145 150 155 160 

Asp Gly Arg He Gin Gly Tyr Val Asp Ala Leu Lys Ala Arg Ser Pro 
165 170 175 

Gin Val Glu Leu Cys Ser He Met Glu Glu Thr Ala Val Ala Lys Asp 
180 185 190 

Arg He Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly Gly Arg 
195 200 205 

He Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr Val Glu 
210 215 220 

Gly Met Arg Trp Ala Gin Ala Phe Ala Arg Asp Arg Ala Val Met Trp 
225 230 235 240 

Thr Leu His Met Ala Glu Ser Asp His Asp Glu Arg He His Gly Met 
245 250 255 

Ser Pro Ala Glu Tyr Met Glu Cys His Gly Leu Leu Asp Glu Arg Leu 
260 265 270 

Gin Val Ala His Cys Val Tyr Phe Asp Arg Lys Asp Val Arg Leu Leu 
275 280 285 

His Arg His Asn Val Lys Val Ala Ser Gin Val Val Ser Asn Ala Tyr 
290 295 300 

Leu Gly Ser Gly Val Ala Pro Val Pro Glu Met Val Glu Arg Gly Met 
305 310 315 320 



m 



Ala Met Gly lie Gly Thr Asp Asn Gly Asn Ser Asn Asp Ser Val Asn 
325 330 335 

Met lie Gly Asp Met Lys Phe Met Ala His lie His Arg Ala Val His 
340 345 350 

Arg Asp Ala Asp Val Leu Thr Pro Glu Lys lie Leu Glu Met Ala Thr 
355 360 365 

lie Asp Gly Ala Arg Ser Leu Gly Met Asp His Glu lie Gly Ser lie 
370 375 380 

Glu Thr Gly Lys Arg Ala Asp Leu lie Leu Leu Asp Leu Arg His Pro 
385 390 395 400 

Gin Thr Thr Pro His His His Leu Ala Ala Thr lie Val Phe Gin Ala 
405 410 415 

Tyr Gly Asn Glu Val Asp Thr Val Leu lie Asp Gly Asn Val Val Met 
420 425 430 

Glu Asn Arg Arg Leu Ser Phe Leu Pro Pro Glu Arg Glu Leu Ala Phe 
435 440 445 

Leu Glu Glu Ala Gin Ser Arg Ala Thr Ala lie Leu Gin Arg Ala Asn 
450 455 460 

Met Val Ala Asn Pro Ala Trp Arg Ser Leu Glu Met Thr Pro Leu Leu 
465 470 475 480 

His Pro Pro Pro Leu Glu Glu lie Ala Ala He Leu Ala Arg Leu Gly 
485 490 495 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Met Gin Thr Leu Ser He Gin His 
1 5 

Tyr Arg Arg Val Leu Gly Asp Ser 
20 

He Val Ala Leu Gly Val His Ala 
35 40 

Arg Val He Asp Ala Arg Gly Lys 
50 55 



Gly Thr Leu Val Thr Met Asp Gin 
10 15 

Trp Val His Val Gin Asp Gly Arg 
25 30 

Glu Ser Val Pro Pro Pro Ala Asp 
45 

Val Val Leu Pro Gly Phe He Asn 
60 



m 



Ala His Thr His 
65 

Gly Arg Gin Phe 



Lys Ala Met Arg 
100 

Ala Glu Ala Val 
115 

Ser Ala lie Tyr 
130 

Glu Val Gly Val 
145 

Asp Gly Arg lie 



Gin Val Glu Leu 
180 

Arg He Thr Ala 
195 

He Ser Val Trp 
210 

Gly Met Arg Trp 
225 

Thr Leu His Met 



Ser Pro Ala Glu 
260 

Gin Val Ala His 
275 

His Arg His Asn 
290 

Leu Gly Ser Gly 
305 

Ala Val Gly He 



Met He Gly Asp 
340 

Arg Asp Ala Asp 
355 

He Asp Gly Ala 
370 

Glu Thr Gly Lys 



Val Asn Gin He 
70 

Tyr Asp Trp Leu 
85 

Pro Glu Asp Val 



Arg Ser Gly He 
120 

Pro Gly Asn He 
135 

Arg Val Val Tyr 
150 

Gin Gly Tyr Val 
165 

Cys Ser He Met 



Leu Ser Asp Gin 
200 

Pro Ala Pro Ala 
215 

Ala Gin Ala Phe 
230 

Ala Glu Ser Asp 
245 

Tyr Met Glu Cys 



Cys Val Tyr Phe 
280 

Val Lys Val Ala 
295 

Val Ala Pro Val 
310 

Gly Thr Asp Asn 
325 

Met Lys Phe Met 



Val Leu Thr Pro 
360 

Arg Ser Leu Gly 
375 

Arg Ala Asp Leu 



Leu Leu Arg Gly 
75 

Phe Asn Val Val 
90 

Ala Val Ala Val 
105 

Thr Thr He Asn 



Glu Ala Ala Met 
140 

Ala Arg Met Phe 
155 

Asp Ala Leu Lys 
170 

Glu Glu Thr Ala 
185 

Tyr His Gly Thr 



Thr Thr Thr Ala 
220 

Ala Arg Asp Arg 
235 

His Asp Glu Arg 
250 

Tyr Gly Leu Leu 
265 

Asp Arg Lys Asp 



Ser Gin Val Val 
300 

Pro Glu Met Val 
315 

Gly Asn Ser Asn 
330 

Ala His He His 
345 

Glu Lys He Leu 



Met Asp His Glu 
380 

He Leu Leu Asp 



Gly Pro Ser His 
80 

Tyr Pro Gly Gin 
95 

Arg Leu Tyr Cys 
110 

Glu Asn Ala Asp 
125 

Ala Val Tyr Gly 



Phe Asp Arg Met 
160 

Ala Arg Ser Pro 
175 

Val Ala Lys Asp 
190 

Ala Gly Gly Arg 
205 

Val Thr Val Glu 



Ala Val Met Trp 
240 

He His Gly Met 
255 

Asp Glu Arg Leu 
270 

Val Arg Leu Leu 
285 

Ser Asn Ala Tyr 



Glu Arg Gly Met 
320 

Asp Ser Val Asn 
335 

Arg Ala Val His 
350 

Glu Met Ala Thr 
365 

He Gly Ser He 



Leu Arg His Pro 



385 

Gin Thr Thr Pro 



Tyr Gly Asn Glu 
420 

Glu Asn Arg Arg 
435 

Leu Glu Glu Ala 
450 

Met Val Ala Asn 
465 

His Pro Leu Pro 



390 

His His His Leu 
405 

Val Asp Thr Val 



Leu Ser Phe Leu 
440 

Gin Ser Arg Ala 
455 

Pro Ala Trp Arg 
470 

Leu Glu Glu lie 
485 



395 

Ala Ala Thr lie 
410 

Leu lie Asp Gly 
425 

Pro Pro Glu Arg 



Thr Ala lie Leu 
460 

Ser Leu Glu Met 
475 

Ala Ala lie Leu 
490 



400 

Val Phe Gin Ala 
415 

Asn Val Val Met 
430 

Glu Leu Ala Phe 
445 

Gin Arg Ala Asn 



Thr Pro Leu Leu 
480 

Ala Arg Leu Gly 
495 



(2) INFORMATION FOR SEQ ID NO; 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Gin Thr Leu Ser lie Gin His Gly Thr Leu Val Thr Met Asp Gin 
15 10 15 

Tyr Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg 
20 25 30 

lie Val Ala Leu Gly Val His Ala Glu Ser Val Pro Pro Pro Ala Asp 
35 40 45 

Gin Val lie Asp Ala Arg Gly Lys Val Val Leu Pro Gly Phe lie Asn 
50 55 60 

Ala His Thr His Val Asn Gin lie Leu Leu Arg Gly Gly Pro Ser His 
65 70 75 80 

Gly Arg Gin Phe His Asp Trp Leu Phe Asn Val Val Tyr Pro Gly Gin 
85 90 95 

Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu Tyr Cys 
100 105 110 



Ala Glu Ala Val Arg Ser Gly lie Thr Thr lie Asn Glu Asn Ala Asp 
115 120 125 



Ser Ala lie Tyr 
130 

Glu Val Gly Val 
145 

Asp Gly Arg lie 



Gin Val Glu Leu 
180 

Arg lie Thr Ala 
195 

lie Ser Val Trp 
210 

Gly Met Arg Trp 
225 

Thr Leu His Met 



Ser Pro Ala Glu 
260 

Gin Val Ala His 
275 

His Arg His Asn 
290 

Leu Gly Ser Gly 
305 

Ala Val Gly lie 



Met lie Gly Asp 
340 

Arg Asp Ala Asp 
355 

lie Asp Gly Ala 
370 

Glu Thr Gly Lys 
385 

Gin Thr Thr Pro 



Tyr Gly Asn Glu 
420 

Glu Asn Arg Arg 
435 



Pro Gly Asn lie 
135 

Arg Val Val Tyr 
150 

Gin Gly Tyr Val 
165 

Cys Ser lie Met 



Leu Ser Asp Gin 
200 

Pro Ala Pro Ala 
215 

Ala Gin Ala Phe 
230 

Ala Glu Ser Asp 
245 

Tyr Met Glu Cys 



Cys Val Tyr Phe 
280 

Val Lys Val Ala 
295 

Val Ala Pro Val 
310 

Gly Thr Asp Asn 
325 

Met Lys Phe Met 



Val Leu Thr Pro 
360 

Arg Ser Leu Gly 
375 

Arg Ala Asp Leu 
390 

His His His Leu 
405 

Val Asp Thr Val 



Leu Ser Phe Leu 
440 



Glu Ala Ala Met 
140 

Ala Arg Met Phe 
155 

Asp Ala Leu Lys 
170 

Glu Glu Thr Ala 
185 

Tyr His Gly Thr 



Thr Thr Thr Ala 
220 

Ala Arg Asp Arg 
235 

His Asp Gly Arg 
250 

Tyr Gly Leu Leu 
265 

Asp Arg Lys Asp 



Ser Gin Val Val 
300 

Pro Glu Met Val 
315 

Gly Asn Ser Asn 
330 

Ala His lie His 
345 

Glu Lys lie Leu 



Met Asp His Glu 
380 

lie Leu Leu Asp 
395 

Ala Ala Thr lie 
410 

Leu lie Asp Gly 
425 

Pro Pro Glu Arg 



Ala Val Tyr Gly 



Phe Asp Arg Met 
160 

Ala Arg Ser Pro 
175 

Val Ala Lys Asp 
190 

Ala Gly Gly Arg 
205 

Val Thr Val Glu 



Ala Val Met Trp 
240 

lie His Gly Met 
255 

Asp Glu Arg Leu 
270 

Val Arg Leu Leu 
285 

Ser Asn Ala Tyr 



Glu Arg Gly Met 
320 

Asp Ser Val Asn 
335 

Arg Ala Val His 
350 

Glu Met Ala Thr 
365 

lie Gly Ser lie 



Leu Arg His Pro 
400 

Val Phe Gin Ala 
415 

Asn Val Val Met 
430 

Glu Leu Ala Phe 
445 



Leu Glu Glu Ala Gin Ser Arg Ala Thr Ala lie Leu Gin Arg Ala Asn 
450 455 460 

Met Val Ala Asn Pro Ala Trp Arg Ser Leu Glu Met Thr Pro Leu Leu 
465 470 475 480 

His Pro Pro Pro Leu Glu Glu lie Ala Ala lie Leu Ala Arg Leu Gly 
485 490 495 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Gin Thr Leu Ser lie Gin His Gly Thr Leu Val Thr Met Asp Gin 
15 10 15 

Tyr Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg 

20 25 30 

lie Val Ala Leu Gly Val His Ala Glu Ser Val Pro Pro Pro Ala Asp 
35 40 45 

Arg Val lie Asp Ala Arg Gly Lys Val Val Leu Pro Gly Phe lie Asn 
50 55 60 

Ala His Thr His Val Asn Gin lie Leu Leu Arg Gly Gly Pro Ser His 
65 70 75 80 

Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro Gly Gin 
85 90 95 

Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu Tyr Cys 
100 105 110 

Ala Glu Ala Val Arg Ser Gly lie Thr Thr lie Asn Glu Asn Ala Asp 
115 120 125 

Ser Ala lie Tyr Pro Gly Asn lie Glu Ala Ala Met Ala Val Tyr Gly 
130 135 140 

Glu Val Gly Val Arg Val Val Tyr Ala Arg Met Phe Phe Asp Arg Met 
145 150 155 160 

Asp Arg Arg lie Gin Gly Tyr Val Asp Ala Leu Lys Ala Arg Ser Pro 
165 170 175 



Gin Val Glu Leu Cys Ser lie Met Glu Glu Thr Ala Val Ala Lys Asp 
180 185 190 




Arg lie Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly Gly Arg 
195 200 205 

lie Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr Val Glu 
210 215 220 

Gly Met Arg Trp Ala Gin Ala Phe Ala Arg Asp Arg Ala Val Met Trp 
225 230 235 240 

Thr Leu His Met Ala Glu Ser Asp His Asp Glu Arg lie His Gly Met 
245 250 255 

Ser Pro Ala Glu Tyr Met Glu Cys Tyr Gly Leu Leu Asp Glu Arg Leu 
260 265 270 

Gin Val Ala His Cys Val Tyr Phe Asp Arg Lys Asp lie Arg Leu Leu 
275 280 285 

His Arg His Asn Val Lys Val Ala Ser Gin Ala Val Ser Asn Ala Tyr 
290 295 300 

Leu Gly Ser Gly Val Ala Pro Val Pro Glu Met Val Glu Arg Gly Met 
305 310 315 320 

Ala Val Gly lie Gly Thr Asp Asn Gly Asn Ser Asn Asp Ser Val Asn 
325 330 335 

Met lie Gly Asp Met Lys Phe Met Ala His lie His Arg Ala Val His 
340 345 350 

Arg Asp Ala Asp Val Leu Thr Pro Glu Lys lie Leu Glu Met Ala Thr 
355 360 365 

lie Asp Gly Ala Arg Ser Leu Gly Met Asp His Glu lie Gly Ser lie 
370 375 380 

Glu Thr Gly Lys Arg Ala Asp Leu lie Leu Leu Asp Leu Arg His Pro 
385 390 395 400 

Gin Thr Thr Pro His His His Leu Ala Ala Thr lie Val Phe Gin Ala 
405 410 415 

Tyr Gly Asn Glu Val Asp Thr Val Leu lie Asp Gly Asn Val Val Met 



Glu Asn Arg Arg Leu Ser Phe Leu Pro Pro Glu Arg Glu Leu Ala Phe 
435 440 445 

Leu Glu Glu Ala Gin Ser Arg Ala Thr Ala lie Leu Gin Arg Ala Asn 
450 455 460 

Met Val Ala Asn Pro Ala Trp Arg Ser Leu Glu Met Thr Pro Leu Leu 



420 



425 



430 



465 



470 



475 



480 



His Pro Pro Pro Leu Glu Glu lie Ala Ala lie Leu Ala Gin Leu Gly 
485 490 495 



